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Thm SPADE theory uses linguistic formalisms to modal the progratt planning 
and debugging processes. The theory has bean applied to constructing a 
grannar-based editor In which programs arc written in a structured fashion, 
designing an automatic programing system based on an Augmented Transition 
Network, and parsing protocols of programing episodes. 

The SPADE theory begins with a taxonomy of haste .planning concepts 
covering strategies for identification, decomposition and re farm let ion + A 
handle is provided for recognizing Interactions between goals and deriving a 
linear solution. 

A complementary theory of rational bugs and associated repair techniques 
is also provided. SPADE introduces a new data structure to facilitate 
debugging -- the derivation tree of the program. The SPADE editor makes this 
structure available to the programer. 

The SPADE theory generalizes recant work in Artificial Intelligence by 
Sussaan and Sflctrdotl oh automatic programming, and extends the theory ef 
program design developed by the Structural) Programing movement. It provides 
m more structured information processing model Of human problem solving than 
thtt production systems of Newell and Simon, and articulates the type of 
problem solving curriculum advocated by Xpert's Logo Project. 


This report describes research done at the Artificial Intelligence Laboratory of 
enu nassaenusetts institute of Technology, The autonatic problem solving aspect of 
this research was supported by the Advanced Research Projects Agency of the Departaent 
of Defense under Office of Naval Research contract N0Dtflfl-75-c-0643, the educational 
aspect by the National Science Foundation undar grant C497QBX, and the protocol 
analysts aspect by the Intelligent Instructional Systems Group at Dolt Bara nek and 
Newman, undar contract number 31DA S03-76-C-0IQA jointly sponsored hy Advanced Research 
Projatts Agency. Air Force Human Resources Laboratory* Army Research Institute, and 
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] Ttili is an expanded version of a paper subsetted to the Firth International Joint 
Conferonce on Artificial Intelligence. 
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I. A Hultl-F ac 0 t ed Approach 


TMi Paper provides an overview of three separate contexts in which the SPADE 
theory is beinsi simultaneously developed and tested -- computer uses in education , 
automatic programming {a traditional Al arenaJ, and protocol analysis (the domain Of 
information processing psychology), Our experience has been that d powerful synergism 
results from this multi 'faceted approach. 

1 ■ The Education Context: an editor has been implemented. that encourages students 
to define and debug programs in terms of explicit SPADE design choices. rhn editor 
provides a structured programming environment based on a detailed theory of the 
processes involved in coherently Structured problem solving. 

2+ The A1 Context: ap automatic programmer called PATN has been designed using an 
augnonted transition network embodiment of the SPADE theory, this results in a unified 
framework which Clarifies recent work oh planning and debugging by Sacerdotl [1973} and 
Sussman [1975], 

3, The [Psychology Context: a parser called PAZAIN has he*n designed that applies 
the SPADE theory to the analysis of programing protocols, PAZAHtf produces a parse of 
the protocol that delineates the planning and dabugging strategies employed by the 
problem salver, PAZATN extends the series of automatic protocol analyzers dnvnloped at 
Cat-tieffle-Mollon University [Waterman S Newell I97Z, 1973: Bhaskar h Sinotl 1976). 

Hand-simulations of PATN and PAZATN on elementary programming prtibiaias and 
informal experiments with the SPADE editor attest to the theory's cogency in accounting 
for a wide range of planning and debugging techniques [Goldstein ft Biller 1976a,b; 
Hiller ft Goldstein 1976b,c,d] r 


g_. A linguistic Anal ogy 

In dovoloping a rapresontlop for problem solving techniques, we have been guided 
by an analogy to computational linguistics, for three reasons., 


!■ The concepts and algorithms of computational linguistics, though originally 






intended to explain the nature of language per se, supply perspicuous ynt powerful 
descriptions of C Diap lex corcputa Cions in general. ProbHim solving, ft A a complex 
process, benefits from the application iff these tools- 

Z * Computational linguistics decomposes complilations into syntactic, semantic, and 
pragnatic components. This decomposition clarifies the explanation of complex, 
processes when viewed In the following manner; syntax formalizes the range of possible 
decisions; semantics the problem description, and pragmatics, the procedural 
relationship between the two. 

3. Computational linguistics has undergone nn evolution gf procedural formalisms, 
beginning with finite State automata, later employing recursive transition networks 
(context free grammarsh next moving on to augmented transition networks, and 
culminating In the current set of theories Involving frames [FT in shy 10 75, 
Winograd 1975, Schank 1975]. tech phase captured some properties of language, but was 
Incomplete and required generalization to core powerful and elaborate formalisms. 
Following this evolutionary sequence illuminates the complexity of language theory, W* 
have pursued a similar evolutionary approach to clarify the complexity of problem 
solving processes. 

To date, our theory of design has evolved through the foil owing stages: #» 
initially explored context free graranars for planning and debugging* and subsequently 
their generalization to ATll‘s; we examined the notaphor of protocol analysis as 
parsing, initially using the planning and debugging graitpnars to reveal the const!tuent 
structure of protocols and later using the derivations produced by the ATN formalism; 
and, finally, our most recent work has studied the use of a chart-basari parser to 
discover these analyses. 


3, A Grammatical Theory of Planning 

The basis for SPAEtE is a taxonomy of frequently observed planning concepts 
(fig. 1). We arrived at this taxonomy by introspection, by examining problem solving 
protocols [Niller b Goldstein 1976b], by studying the literature on problem solving 
[Polye 1957, 1965, 1960; Newell A Sierjn 1972; Sussman 1975; Eacerdoti 1975], and by 

enumerating techniques for finding procedural solutions to problems expressed as 
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FIGURE 1 

TAXONOMY OF PLANNING CONCEPTS 























predicate calculus formulae [Fndrrc ft; Kowalski 1976]. this last criterion demonstrates 
that the taxonomy Is currently incomplete -- for example, techniques, rur handling 
disjunctions have not yet boon analyzed thoroughly trough to warrant inclusion. 
However h the taxonomy Is adequate for a wide range of elementary prograwring problems. 
Future research, will investigate additional planning techniques. 

There are three major classes of plans lh the taxonomy; Identification* 
decomposition., and reformulation. Identification means recognizing a problem as 
previously solved. Decomposition ref or S to Strategies for dividing a problem into 
simpler sub-problems. Reformulation plans alter tins problem description, seating m 
representation which is more amenable to identification or decomposition. The figure 
indicates ho-w these classes of plans aro rurther subdivided in the SPADE theory. 

Planning, according to the theory, is a process in which tho problem solver 
selects, the appropriate plan type* and then carries out the subgoals defined hy that 
plan applied to the current problem. From, this viewpoint, the planning taxonomy 
represents a decision tree of alternative plans, the decision process con bo modeled 
by the contact free granmar given below, the gracmar explicitly States which planning 
rules involve recursive application of solution techniques to subgoalst setup, 
interface, moLustep, cleanup, and parallel. 

{The rules of the grainnar are written using the following syntax; m \* is 
disjunction, is ordered conjunction, "ft" Is unordered conjunction, 

H 

■< T ..> ‘ is iteration, [Is optionality, and a lower case English phrase 
enclosed In quotation marks (e.g,, "repeat step"| describes a lexical item 
which is not further expanded in the grammar,) 1 


PLAN 


O IDENTITY i DECOMPOSE | REFORMULATE 
*> PAIR HIVE l DEFINED 
-> "call user subprecedure" ft FLAM 
-> CONJUNCTION \ RET EXIT ION 
-> SEQUENTIAL | PARALLEL 

-> {SETUP ] * < MINSTER + ( INTERFACE]>* + [CLEANUP] 
-> <FiAM>* 

-> PLAN 


IDENTIFT 


DEF INFO 


DECOMPOSE 

CONJUNCTION 

SEQUENTIAL 


PARALLEL 


SETUP 


HAINSTEP 

-> 

PLAN 

INTERFACE 

-> 

PLAN 

CLEANUP 

-> 

PLAN 

REPETITION 

-> 

ROUND E RECURSION 

ROUND 

-> 

ITER-PLAN \ TAIL-RECUR 

ITER-PLAN 

-> 

'repeat Step" + SEQ 

TAIL-RECUR 

-> 

'stop step" * SF'J + "r 


The SPADE th eery is not restricted to any particular domain, However * to provide 
concrete «Kimpi«s h we have concentrated on problems Iron elementary Logo sirophics 
programing [Paport 15711, This domain was chosen because of tho aval Lability of 
extensive date on the per foment; e of students writing *turtlc p programs to draw simple 
pictures, The gracanar rules for primitives in this dooairi are; 


PRIKITIVE 

VECTOR 

ROTATION 

PF-NSTATE 


■> VECTOR | ROTATION ] PEN5TATE 
■> (FORWARD | BACK) * (nuabor? 
-> (IE FI | RIGHT 3 + (number? 

■> PENUP ! PENDOWN 


A typical task undart ah on by beginners in the logo environment is to draw a wishingwnli 
picture using the computer - 


Fig. 2 — A Logo Hl&hingwell 

Hg. 3 illustrates e solution to the wishtngwelt problem with its hierarchical 
annotation according to our planning gnine-jr. 

The graraaar characterizes the decision process Involved In selecting plans from 
the taxonomy. Kb Illustrate its utility In the nett bra Sections by constructing nn 
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edit or that embodies the grammar and analyzing debugging U terns or the S™ar. ™en 
WB *huw how the grammar tin be augmented to include not only the syntax or plans, blit 

,hoii* semantics add pragmatics as wall, 

4 , 57ADE-Q, ft Pi ar m inn Assistant 

3ni reason for calling o»r theory of planning and debugging structured is to 
edphajijo the link betw**n our research and the Structured Programming movement, Dahl, 
Dljfcstra. and Hoars [1&7Z] call for a style of programming which reflects coherently 
structured problem solving. But a detailed formalization of what this style entails i* 
lacking. Our efforts in this direction, therefore, supplement the work of Dijfcttra and 
othars* The 5frADE-0 editing environment serves as an example- 

How can wa judge whether a particular grantwr of plans captures, at some level of 
abstraction, the sot of planning decisions involved in solving problems for some 
domain? One Qa theology WO employ is f incorporate the grammar Into an editor (SPADE- 
0 ) whose purpose is to augment and direct the capabilities of e human user, Tt» 
critical Question then becomes determining the extent to which such a system aid* or 

hinders the user, 

suppose a problem solver is defining a Logo program for drawing the wlshingwell 
shown earlier - Itl SPADE-0, this it accomplished by applying the planning gransar in 
generative mode: 

le. What is the nans of your top level procedure? 

lb. >wv 

Za. The rule is- PLAN *> WWim I DECOMPOSE 1 REFfflWJLATE. 

What now? 

£b, >CtCOhPOSE, 

Sa. The ruls is: DECOMPOSE -> CON-UNCTION l REPETITION 

3t>, >COWJUNCTION 


4a, The rule is; CONJUNCTION -> SEQUENT! At. 




The rule is: SEQUENTIAL -> {SETUP } + OUIH&TEF + [ INTERFACE*■ {CLEANUP]. 
Pc you Vi'iih tc (iflfinfl an optional SETUPT 

4b. Later. 


Sfl , * + * 

In this way, SPADE - D- eACoUripes users to articulate their design decisions In top- 
down order. At the sane tice, the System, allows the user to escape from this Strict 
discipline if «n alternative solution order scons preferable. This was illustrated by 
the user's "later 1 instruction. "Later 1 suspends tlie current goal for Later solution. 

SPADE-0 was impler-ented by assigning an interpretive procedurR to each grammatical 
operator. In essence, the- editor is a bookkeeper for the user's goal tree. Despite 
its simplicity. wo sso throo useful applications of the program. 

1 . From Uil educational standpoint, the editor encourages students to articulate 
their problem solving strategies. The fundamental hypothesis of the Logo Project* as 
presented by Rupert in feeofliflp Children TAfatfirp [1971], Is that such articulate 
problem solving is beneficial to the learner. The SPACE editor, with its extreme form 
of articulation, provides an experImenta] vehicle for evaluating Papert's claim} do 
students exposed to SPADE progress Faster in learning Lego than controls whose problem 
solving is more tacitT 

Z. From an AI standpoint* its use will indicate whether the planning grammar is 
adequate, or whether certain plans are not present that competent problem solvers feel 
are necessary. 

3. From a psychological standpoint, we will collect transcripts of individuals 
using tha ad iter end formulate fiersoflm! uraranurj based on the particular rules usually 
employed &y eAoh user, The personal grammar will modal the prohlau solving Shills of 
that individual. In the past we have manually analyzed protocols from Standard Logo. 
SPADE protocols* with their explicit planning choices, can be far more revealing. 


5, RAID, A Pefrugg In y Assistant 


5PAUE includes a theory of debugging.. Such a theory is essential, since problem 
solvers oust often formulate Mens in the face of imperfect knowledge and United 
resources. Under such circumstances, oven carefully reasoned Judgments nay bo 
mistaken. 

Given a grammatical theory Of planning, debugging can bo analyzed as the 
localisation and repair of errors in applying gramaar rules during planning. since our 
planning rules were constructed from operators for conjunction, disjunction, ind 
optionality, three basic classes Of errors arise; 

l. syntactic bugs, in which the basic grammar is violated, such as when 4 
required conjunct is missing; 

2+ Semantic bugs., in which a semantic constraint arising iron the particular 
problem is violated, such as when A syntactically optlortei constituent, needed 
because of the semantics of the particular problem. Is missing; 

3, pragmatic bugs, in which an inappropriate selection from a sot or nutually 
exclusive dlj/blflcls is made. 

Fig. 4 provides examples of these bug types- Although additional categories 
(including execution and efficiency bugs) must be defined tg make this taxonomy of bugs 
complete, these Classes are adequate to characterize many examples which irlie In 
elementary programing. 

RAIlO is a debugging assistant for SPADE-11 implemented oh, the basis of this 
taxononiy of errors. As 5PA&E-0 prompts the user through planning process, those 
choices which might lead to syntactic, semantic, or pragmatic bugs are recorded by the 
systea. In subsequent debugging episodes, RAID alerts the user to the relevant bug 
warnings, thereby aiding in diagnosis, Jlorenvor, the SPADE-u planning editor can than 
be re invoked to guide repair of tho bug by resuming planning in the context of tti* 
culpable decision,. An interaction with PA Ip that would arise from premature execution 
of a program is illustrated taalgw; 




FIGURE A ~ SYNTACTICALLY INCORRECT I’I .AM 
ft NECESSARY CONJUNCT IS MISSING 

TO WW 

10 TRIANGLE -USE--- ... 

-ID-PLAN 


END 


ww GET 

??? TRIANGLE UNDEFINED ??? 


'"GET" MISSING. UNGRAMMATICAL PlAN 
DEBUG BY COMPLETING FLAN.} 


GET TRIANGLE FILE 


FIGURE 4 - SEMANTICALLY INCORRECT PLAN 
AN OPTIONAL CONJUNCT IE MISSING 


FOR EXAMPLE: 


"WW r| MISSING INITIAL FETUP, AND ! NTERFftCF 


TO WW 


10 KELL -—MAINSTEF—| 
20 POLE —MAINSTEP 


l seo-PLAN 

r 

FIGURE r, - PRAGMATICALLY 
AN INCORRECT DISJUNCT MAS 



Atari:- hOn: 


INCORRECT PLAN 
BEEN SELECTED 


FOR 


TO SQUAPE- INSIDR-TRIANGL E 
10 SQUARE 
20 TRIANGLE 
END 


INTENDED PICTURE: 



LINEAR PLAN -- 
SQUARE AND TRIANGLE 
DESIGNED 
INDEPENDENTLY* 


ACTUAL. PICTURE: 



r-nr.l . 















































fAff flier has CflJTti need 1 ItffifllflfUJel 1 propron freffua €ar tier, mi t A 

ftJ carrejtt ie/taftton us Shaun in /iff, 4b r AttAutlff* <fl* initial SETUP dfttf flff 
tKTERFACE hast net Jf#l Ueeff supplied, tAs fljor re ^ti e-i t i ejcecirttow of ifta 
prwrtM, 

la. ... What now? 

lb. >RUN WISH!NOWELL, 

2ft. OK, (Ftf 4b ihotti tht retuLting pi ctLTftJ 

What ti owl 
2b. JOiognosa 

la. The rule IS: DIAGNOSES <PARSE f COPE \ MODEL | PROCESS**, 
what ti&wr 
3b, JParie 

4a, The rule i*: PARSE -> SVNTM 1 SEMANTICS I PRAGMATICS. 

¥hftt now? 

4a, >£em.antici 

5ft. The following optional constituents are Hissing In WW; 

The SETUP and the INTERFACE between WELL and POLE, 

What now? 

5b, >Repalr 

6&. The rule Is: REPAIR -> COMPLETE | REPLACE. 

6b. ^Complete interface between well and poll. 

7ft, The rule is; PLAM -> IDENTIFY [ DECOMPOSE I REFORMULATE 
7b. > ♦*« 

In this cAemjfla, the interaction with SPATE wes again guided by a grammar; thii 
dabuaging grammar is given below. 


DIAGNOSE 

PROCESS 

CODE 

MODEL 

PARSE 

SYNTAX 

SEMANTICS 

PRAGMATICS 

REPAIR 

COMPLETE 

CORRECT 

RUN 

ASK 

TRACE 

SELF-DOC 


-> <PARSE | CODE | MODEL ( PROCESS>" 

-> <ASK 1 TRACE>* 

-> -print definition" 

-> RUN 

-> SVNTAA 1 SEMANTICS | PRAGMATICS 
-> "review conjunctive choices in plan* 

-> "review optional chfflcu In plan" 

-> "review disjunctive choices in plan" 

-> COMPLETE | CORRECT 
-> -solve for missing conjunct" 

-> "chouse altarhative disjunct" 

-> "run code line" + ["error nsg"] + [DEELfGl 
-> '"print definition" l "print value" 

*> [SELF-DOC*] + RUM* 

-> "add breai" f 'add print" [ "add trace" 


This grammar represents the traditional dehuBSluff strategies employed by programer* 
supplesented by the PARSE-based strategies allowed by the existence of the SPADE 
da rivet1on tree. 

For more complex programs* there will be too many potential bug locations for 
RAID 1 s undirected advice to ba helpful* Hence* following SPADE-O^s structured 
hpaAAeepfnp philosophy* we allow the user to set caveats during planning regarding 
those decisions which appear to be potential trouble spots* in this fashion* SPADE 
permits an exploratory style of problem solving* as illustrated below. 

Suppose the mlshfapmeH program is part of C large project, Thee the fttuirber 
of optional constituents that Jtaire not been dtf tncd will fie large. The user 
will probably finite s cmc tfls At into srMth o/ these Is actually needed, To 
/acifltote defiujtfiffj, tfte user is encouraged ta piece ceaeata at those 
decisions Ac suspects way rruufrc later rfefiapjrlnp. 

5a* Do you wish to include the optional initial SETUP in WW? 

Sb* >»□- 


7a, Po you wish to record a caveat here? 

7b. >Tos, 

Ha. OK + Four pending seals are HOOF, POLE and WELL. What now? 

9b, > ... 

Armthar mechanism to aake SPADE 1 ? and HA CD’s advice more directed Is to supply ID# 
semantics for problems. This is the topic of the next section. 

5. PATK — An Augmented Transition Network for Flapping 

While context free grammars CHI represent a useful abstraction af planning 
decisions, they have limitations which prevent then from providing i Complete theory of 
design. they provide no representation for the semantics of the problem nor for th# 
pragmatics Involved in choosing one plan over another. For this rot son, W* have 
designed and are currently implementing PATH. nr augmented transit ten network (ATN) 
problem solver, Vb have adopted the AIN formalism for the Same reasons that lnri Woods 
[1970] to introduce it into computational linguistics: the semantic and pragmatic 
lie itat ions of context free grammars. 

Fig, S provides a global view of PATH [Goldstein a Miller 1976b], the topology of 
tho notwork embodies the planPing grammar, Registers Contain descriptions oT the 
problem, the solution, and various tenporary constructs built during planning. Arc 
predicates supply pragmatic guidance by examining the registers and appropriately 
directing the planning process. For example, an identification plan cannot proceed if 
the problem description cannot be found in the answer library. PATN has bn on 
successfully hand-simulated on elementary Logo and Blocks World problems, 

FAIN allows u? to elaborate our notion of a completed plan by defining An 
ffimtatetf dariuption tree. Associated with each node of the plan derivation is a 
snapshot of the vaiuas of the AIN register? at the point iti the planning process when 
that node was created. A derivation tree reveals the constituent structure of th* 
plan; these semantic uarljfciej reveal the semantic intent. A set of itsstrtfa« at 
each node, derived, as instances of PATH'S arc predicates, reuaal thB prapraaElc reasons 
for preferring a given plan ever Its competitors. Fig. 6 shows the annotated 
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FIGURE 5 A SIMPLIFIED VIEW OF PATN 
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FIGURE 6 PATH'S AiuictaU’d DERIVATION TREE FOR THE WISHI11GWELL TASK 


































derivation trco for the >rish in11 progrim given earlier 


An. in ten dud: application of PATW Is as a module of the SPADE editor for providing 
the user with additional direction and advice. The following hypothetical dialogue 
With SPADE- 1 (the SPADE editor augmented by PATH | illustrates this., 

la. Tell me the name Of your toplevel procedure? 

lb, >WW 

In, Would you like to define a Riddel? 

Zb. >WISHIKtVFLL 

Ha, CHL, WISHfNC>W£LL modal found in Hndfll Library, The rule is; 

PLAN -> IDENTIFY ! DKOfifOSt | F.E FORMULATE, What now? 

3b, >ROOF POLE WELL 

da. These Sdhgeals correspond to model parts. 

Are you pursuing 9 LINEAR PECOfirdSITION plan? 

Ab . >YEfl 

5u. the pending subyoaU are: ROOF t POLE, WELL and the preparatory stops. 

What now? 

5h, >R«JF 

fin. In tht WW model, ROOF Is d-escrlibed as a triangle, 

Thera is a TRIANGLE procedure in the answer library. 

Would you lika to use it? 

6b, >Y£5 

7a, OK,, The ROOF is complete, Your pending goals art: 

■ POLE, WEIL, and the preparatory Steps. What new? 

7b. > 

PATN's Linearization Loop provides a Lheoretleal superstructure for handling 
dependent, interacting subgoals, The approach is to Haw a npn-linearity a; 9 property 


of the- problem description. That is. even though one description of the problem leads 
to 00 unsuccessful refinement process because of Interactions between subgcals, another 
description cun succeed by making the dependent constraints explicit. For example, 
consider the SQUARE-INBIDE-TR.JANDLE or fig. 4C, Jf the problem description is; 

SQUARE Jt; 

TRIANGLE Y; 

INSIDE X Yt 

then a decomposition that draws the Square and triangle independently and then attempts 
to fit them together to achieve the inside relation will fell. However, a problem 
description of the following form Allows a successful decomposition; 

SQUARE X, WITH STUE = 10tt E 

TRIANGLE Y, WITH BIDE = 3(0; 

CENTER flF * i CENTER OF Y. 

The INTERACTIONS predicate is a con junction of tests on the model register. Each 
tas,t is responsible for detecting a given non-linearity. A corresponding action 
nodi fits the model, adding new statements to make the interaction explicit, The 
REFINEMENT loop is the repository for what Sussran [1375} calls the Critics Gallery. 
The theoretical progress of PATH is to integrate the Critics Gallery concept into a 
tboory of planning. In Sussman's HACKER, the critics gallery and library of 
programming techniques were 5 operate modules; there Was no integrated theory. 

Of course, at any point In time the system nay be unaware of a given type of non - 
linearity r In such cases, the absence or nn interaction test will lead to a sequential 
do compos i i ion that Ultimately fails. The design of a pro grain for debugging such 
failures is the subject of the pent section. 

T, DA PR - - An Augmented Transit ion Network for D ebugg ing 

PAIN Can make mistakes. That is, PATN will sometimes introduce what we term 
rational hugs into its plans, duo to making arc transitions with imperfect know lodge of 
subtleties or interactions in the task domain. Hence, PATN ms-t be equipped with a 





COHpleoeiVtary debugging module, CAPR (Tig, 7), 


Vhile RAID was designed to assist human progr (Miners in finding a variety of bugs 
[primarily by plan diagnosis), MPR was difSinrted tp analyze PATH'S bugs. DAPR employ* 
three diagnostic technique*i model, process, and plan diagnosis r Model diagnosis 1* 
the basic technique. It amounts to comparing the effects of executing a plan to m 
formai description of its goals, tD detarraine if, and in what fashion* the plan hi* 
Tailed. Another DA PR technique, based on Eussmatt's HAftfR [1975], is examining the 
state of the process at tha time of the error nan i fast at ion. Plan diagnosis is a DAPft 
first. It is accomplished by examining the CKireirts variable associated with various 
nodes of PAf^s annotated plan derivation. 

DAPR will also be used to provide additional guidance to RAID* This illustrates 
tfrio synergism possible when educational, psychological and AI facets of a cognitive 
theory am studied ip an integrated fashion. This integration is further exemplified 
in the next section when vh apply the SPADE theory to protocol analysis. 

6. PAZATW, an A utocr atic Protocol A n a lyzer 

As soon as one has an heurlstically adequate theory of design, it is natural to 

ask, ‘Can the theory provide an account of how people solve problems?"'. An 

experimental technique we employ for answering this question is the analysis Of 

protocols Collected during problem solving sessions. By adopting this methodology we 
follow the precedent established In seminal protocol analysis studies conducted at 
Carnegie Mellon University [Rowell A Simon 19/Z 3 Waterman A heWe 11 1972 , 1073; 

Bhaskar ft Simon IMG]. Our wprk extends their approach along three dimnnsions. 

L. With the exception of the recent Bhaskar ft Simon effort, the CMU Studios have 
been restricted to very limited domains such as cryptarHhmetlc, Ratbar than limit in a 
the task domain, We limit the rahne of responses. Typically protocols are 
trunsorlpt ions Of tbink*a|oud verbalizations: w e focus on the more restricted 

interactions arising from a problem solving session at a computer console. The 
analysis task in this setting is to interpret user actions -- milting, executing* 
tracing, etc. -- in terms of the SPADE, theory of planning and debugging. 
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2, The CHU theory centers on tSim p roduct i on Systems modef,. Although productions 
are Turing universal* they tend to mult in a less structured program organization 
than the linguistic formalisms of the 5TADE theory. In PATH* each arc transition* 
consisting of a predicate and an action, can be thought of as a production, However, 
PATH organizes these productions into local contexts, each of which consists of the 
area exiting from a given node. Not all of the arc productions are present at any 
moment in time; an arc is present only when tha problem solver is at the relevant 
node* In the production systems discussed in fluiftd.it Problem Solving [Newell ft 
Slfeon t97£j* all of the productions are always present atid are tested in serial order* 

CHU analyses aro based on the problem iehejjior ftrvph. Pursuing an analogy to 
computational linguistics, we define an interpretation of a protocol to be a perse tree 
supplemented by semantic and prattle annotation, Tha parse tree characterizes the 
constituent structure of the protocol. Semantic and pntymntie annotation -- variables 
and assertions attached to nodes of the parse tree ** formalist the problem description 
and the rationale for particular planning choices. Annotated parse trees closely 
reflect the local structure of PATtt's linguistic problem solving machinery, leading 
more directly to 1F1 Ter eEiceS regarding individual differences than Is evident from 
problem behavior graphs. 

ttuven Brooks [1975] applied the CBU approach to the programming domain, 
developing a model of eerf-iirji -- the translation of high level plans into the statements 
of a particular programming language -- and testing tha model by analyzing protocol*, 
His nods I is a set of production rules whose conditions match the patterns of plan 
elements and whose actions generate code Statements. Protocols Are analyzed manually. 
With the experimentrr attempting to infer the plan which is than expanded by the 
production systeu into code paralleling that of the protocol. The processes of 
understanding tha problem, generating the plan, and debugging are not formalized, 
SPACE goes beyond this in that it can be used to parse protocols and that tha parse 
constitutes a formal hypothesis regarding not only the coding knowledge but also the 
planning and debugging strategies Employed hy tho problem solver- 

[Killer a Goldstein i?7eb) provides a detailed example of such analysis being 
performed by hand. The example Is A segment from a protocol several hundred lines long 
in which n high school student use; tha Logo turtle %a draw the letters of his name. 


By eserairtino tha grammar rules present m the derivation, we can readily observe 
various properties of the student's problss solving such as: reliance on certain 
planning choices to tha exclusion of others (e.g., the student employad iteration, but 
never recursion!: the misuse of certain optional constituents (e.g,, a setup was 
usually included in each procedure even wheh it was unnecessary): and certain 
situations whore his problem solving violates the grammar and hence is susceptible to 
syntactic errors fe.g., programs were often executed before their subprocedures fieri 
been defined). 

Just as a context free grammar Is incomplete as a theory of planning, likewise a 
parse is only a partial analysis of a protocol. The theory of annotation developed in 
the FATN work led us from describing only the syntactic structure to more comp lute 
analyses or protocols; an interpretation of a protocol is the selection of a 
particular a a note ted PATH plan derivation. Fig, ft shows such an analysis of a 
Simplified protocol In which a wish 1 ngwo 11 program IS defined, executed and debugged, 

PAZATN is a chart-based parser [Kay 1971; Kaplan 3575] being implemented to 
Interpret protocols in terms of PATH'S annotated plan derivations [Filler ft 
Goldstein )976d]. it will operate by causing PATN to deviate from its preferred 
approach in response to hottom-Up evidence (fig. 5). By taking advantage of powerful 
parsing strategies developed in research on speech understanding [Lesser et at. 1973j. 
PftKtcrn ft Robins op 1975], as well as the economical chart representat Ion of ambiguities, 
PAZATW has been successfully hand-simulated op approximately 10 Logo protocols, 

PAZATff will Operate by matching PATN-generated plans with protocol data. Two 
charts will be used to represent alternative interpretations. the PL AN CHART keeps 
track of the set of plausible subgoals which have been proposed by PATH. Fig. ID shows 
a plan chart for a wishtngwall in which PATN has proposed two alternative 
decompositions. the structure is a chart because it shares substructures, as 
OMeniplif led by the cannon solution to the IrfElL SUbgoel pointed to by both wish in swell 
decompositions. The &ATACHART records the Stele of partially completed 
interpretations. Fig, 11 shows how the detacher! links events into the pltnchmrt for a 
PAZATN Interpretation of the wishingwell protocol given earlier. 


These charts are grown as follows. First PAZATN requests PAIN to generate its 
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mast plauiibie plan. This plan is inserted into the PLANCHART. Then protocol events 
are analyzed one by one, anil matched with subgoals In the PATN plan. The natch la 
recorded In the DATACHART, If no plausible matches are found, PATH is naked to 
generate the next most plausible plan. The PtAtfCHART is thereby extended. Common 
sub goal s share the same structure in the chart. 

At first* PAZATN Kill be implemented interactively. Kith the User -- a 
psychologist analyzing a protocol -- direction PA2AIN to select different PATN plans. 
This follows the incremental implement a tioc strategy used in two of the CNU protocol 
analyzers [Waterman A Newell 1972; Bhaskar & Sinon 1976]. PAZATN, even in it? onrly 
interactive stage*. should provide strong evidence regarding PATN 1 * adequacy as ■ 
cognitive theory. 

PAZATH Kill also he tested in the SPADE contest. Below is a hypothetical dialogue 
with SPADE-2, representing the original SPADE-0 augnnted by bath PATH and PAZATW. 

la. We are now solving for a WlSHlPflGWELL- Ytrnr pending subgoals are; 

ROOF, POLE,, WELL, and the interfaces. Miat now? 

lb, >SOUARt 

2a. OK, I a» assuming that WEU has been solved hy identifying It 
with the previously solved SQUARE procedure. What nowT 

Zb, > ... 

PAZATN will Increase the editor's flexibility In hand Hug ambiguous events, and in 
alleviating what might seem to some users to be an excessive allocation of time and 
effort to the planning phase, tlltldnfcaly we expect SPAOE-Z to provide a progranmlng 
environment of value to professional as well as Student programmers, 

9. CoitcIns 1 ans 

An outline Of the SPADE theory or design has been presented. Ons theme has been 
the aynorolsm achievable in a multi-faceted research effort. Programs, have been 
designed and arc being implemented for testing SPADE In educational, psychological and 
A1 contexts. The AI dimension involve. 1 ; the development of a spectrum of possible 



theories of cognition. The psychological and educational dimensions supply feedback 
regarding which parts of a theory are rejected by competent human problem Solvere, 
Furthermore, the psychological dimension would be incomplete ware it not to address the 
Issue of learning, while the educational dimension characterizes the trajectory of th# 
student's cognitive state through time, 

A second ttotnec has been the exploitacten of concepts and algorithms fro® 
coin,putstianal UngUtitles: gratmars, ATN's, derivation trees, search strategies from 
speech understanding, chart-based parsers. Computational linguistics is also 
responsible for suggesting the propitious decomposition of' problem Solving processes 
into CPmpCUIonts Involving syntactic, semantic and pragmatic knowledge. 

He believe that our unified approach to AI, psychology end educalien represents n 
now research paradigm offering the potential for considerable progress in all three 
fields. But much remains to be done. Although all of the programs have been designed 
*nd hand-simulated, ohly the SPADE'Q Editor has been implemented. Furthermore, while 
SPADE has been applied to Lugo graphics, blocks world and elementary calculus problems. 
It has not yet been exercised in enough contests to prove its generality. If this 
research project succeeds, a new clarity Will have bean brought to the Study of problem 
solving. Even If it fails, the reasons fur the failures should provide useful 
insights. In any case, it has already unified the treatment of plans and hugs, a 
significant stride for a theory of problem solving. 
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